Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 8359 |
| Missing cells | 24088 |
| Missing cells (%) | 18.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.0 MiB |
| Average record size in memory | 128.0 B |
Variable types
| NUM | 9 |
|---|---|
| CAT | 7 |
Name has a high cardinality: 6231 distinct values | High cardinality |
Publisher has a high cardinality: 295 distinct values | High cardinality |
User_Score has a high cardinality: 88 distinct values | High cardinality |
Developer has a high cardinality: 1126 distinct values | High cardinality |
Global_Sales is highly correlated with NA_Sales and 1 other fields | High correlation |
NA_Sales is highly correlated with Global_Sales | High correlation |
EU_Sales is highly correlated with Global_Sales | High correlation |
Year_of_Release has 84 (1.0%) missing values | Missing |
Critic_Score has 4383 (52.4%) missing values | Missing |
Critic_Count has 4383 (52.4%) missing values | Missing |
User_Score has 3528 (42.2%) missing values | Missing |
User_Count has 4660 (55.7%) missing values | Missing |
Developer has 3489 (41.7%) missing values | Missing |
Rating has 3561 (42.6%) missing values | Missing |
Other_Sales is highly skewed (γ1 = 24.74795541) | Skewed |
Name is uniformly distributed | Uniform |
NA_Sales has 2311 (27.6%) zeros | Zeros |
EU_Sales has 3002 (35.9%) zeros | Zeros |
JP_Sales has 4807 (57.5%) zeros | Zeros |
Other_Sales has 3218 (38.5%) zeros | Zeros |
Reproduction
| Analysis started | 2020-12-05 17:19:27.274453 |
|---|---|
| Analysis finished | 2020-12-05 17:19:56.032107 |
| Duration | 28.76 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 6231 |
|---|---|
| Distinct (%) | 74.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.3 KiB |
| Ratatouille | 9 |
|---|---|
| LEGO Marvel Super Heroes | 9 |
| LEGO Jurassic World | 8 |
| Lego Batman 3: Beyond Gotham | 8 |
| Cars | 8 |
| Other values (6226) |
| Value | Count | Frequency (%) | |
| Ratatouille | 9 | 0.1% | |
| LEGO Marvel Super Heroes | 9 | 0.1% | |
| LEGO Jurassic World | 8 | 0.1% | |
| Lego Batman 3: Beyond Gotham | 8 | 0.1% | |
| Cars | 8 | 0.1% | |
| LEGO The Hobbit | 8 | 0.1% | |
| LEGO Harry Potter: Years 5-7 | 8 | 0.1% | |
| The LEGO Movie Videogame | 8 | 0.1% | |
| LEGO Star Wars II: The Original Trilogy | 7 | 0.1% | |
| Star Wars The Clone Wars: Republic Heroes | 7 | 0.1% | |
| Other values (6221) | 8279 | 99.0% |
Frequencies of value counts
Unique
| Unique | 5015 ? |
|---|---|
| Unique (%) | 60.0% |
Histogram of lengths of the category
Length
| Max length | 132 |
|---|---|
| Median length | 22 |
| Mean length | 23.98265343 |
| Min length | 2 |
Platform
Categorical
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.3 KiB |
| DS | |
|---|---|
| PS2 | |
| Wii | |
| PS3 | |
| PSP | |
| Other values (26) |
| Value | Count | Frequency (%) | |
| DS | 1106 | 13.2% | |
| PS2 | 1104 | 13.2% | |
| Wii | 645 | 7.7% | |
| PS3 | 643 | 7.7% | |
| PSP | 642 | 7.7% | |
| X360 | 588 | 7.0% | |
| PS | 512 | 6.1% | |
| GBA | 445 | 5.3% | |
| PC | 439 | 5.3% | |
| XB | 371 | 4.4% | |
| Other values (21) | 1864 | 22.3% |
Frequencies of value counts
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.786577342 |
| Min length | 2 |
| Distinct | 38 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 84 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2006.393716 |
|---|---|
| Minimum | 1980 |
| Maximum | 2017 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.3 KiB |
Quantile statistics
| Minimum | 1980 |
|---|---|
| 5-th percentile | 1995 |
| Q1 | 2003 |
| median | 2007 |
| Q3 | 2010 |
| 95-th percentile | 2015 |
| Maximum | 2017 |
| Range | 37 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 6.099620895 |
|---|---|
| Coefficient of variation (CV) | 0.003040091706 |
| Kurtosis | 2.066641924 |
| Mean | 2006.393716 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -1.100803253 |
| Sum | 16602908 |
| Variance | 37.20537506 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=38)
| Value | Count | Frequency (%) | |
| 2008 | 735 | 8.8% | |
| 2009 | 696 | 8.3% | |
| 2010 | 645 | 7.7% | |
| 2007 | 625 | 7.5% | |
| 2011 | 555 | 6.6% | |
| 2006 | 510 | 6.1% | |
| 2005 | 471 | 5.6% | |
| 2003 | 419 | 5.0% | |
| 2004 | 404 | 4.8% | |
| 2002 | 375 | 4.5% | |
| Other values (28) | 2840 | 34.0% |
| Value | Count | Frequency (%) | |
| 1980 | 4 | < 0.1% | |
| 1981 | 34 | 0.4% | |
| 1982 | 23 | 0.3% | |
| 1983 | 14 | 0.2% | |
| 1984 | 9 | 0.1% |
| Value | Count | Frequency (%) | |
| 2017 | 3 | < 0.1% | |
| 2016 | 260 | 3.1% | |
| 2015 | 304 | 3.6% | |
| 2014 | 284 | 3.4% | |
| 2013 | 279 | 3.3% |
Genre
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.3 KiB |
| Action | |
|---|---|
| Role-Playing | |
| Misc | |
| Sports | |
| Adventure | |
| Other values (7) |
| Value | Count | Frequency (%) | |
| Action | 1743 | 20.9% | |
| Role-Playing | 912 | 10.9% | |
| Misc | 905 | 10.8% | |
| Sports | 879 | 10.5% | |
| Adventure | 741 | 8.9% | |
| Shooter | 584 | 7.0% | |
| Platform | 565 | 6.8% | |
| Racing | 525 | 6.3% | |
| Fighting | 440 | 5.3% | |
| Strategy | 390 | 4.7% | |
| Other values (2) | 675 | 8.1% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 12 |
|---|---|
| Median length | 6 |
| Mean length | 7.273238426 |
| Min length | 4 |
| Distinct | 295 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.3 KiB |
| THQ | |
|---|---|
| Nintendo | |
| Sony Computer Entertainment | |
| Sega | |
| Take-Two Interactive | 422 |
| Other values (290) |
| Value | Count | Frequency (%) | |
| THQ | 715 | 8.6% | |
| Nintendo | 706 | 8.4% | |
| Sony Computer Entertainment | 687 | 8.2% | |
| Sega | 638 | 7.6% | |
| Take-Two Interactive | 422 | 5.0% | |
| Capcom | 386 | 4.6% | |
| Atari | 367 | 4.4% | |
| Tecmo Koei | 348 | 4.2% | |
| Warner Bros. Interactive Entertainment | 235 | 2.8% | |
| Square Enix | 234 | 2.8% | |
| Other values (285) | 3621 | 43.3% |
Frequencies of value counts
Unique
| Unique | 93 ? |
|---|---|
| Unique (%) | 1.1% |
Histogram of lengths of the category
Length
| Max length | 38 |
|---|---|
| Median length | 10 |
| Mean length | 12.83263548 |
| Min length | 3 |
| Distinct | 345 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.71994258 |
|---|---|
| Minimum | 0 |
| Maximum | 4136 |
| Zeros | 2311 |
| Zeros (%) | 27.6% |
| Memory size | 65.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 8 |
| Q3 | 25 |
| 95-th percentile | 125 |
| Maximum | 4136 |
| Range | 4136 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 104.3499353 |
|---|---|
| Coefficient of variation (CV) | 3.396814139 |
| Kurtosis | 472.0553993 |
| Mean | 30.71994258 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 17.0044509 |
| Sum | 256788 |
| Variance | 10888.909 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 2311 | 27.6% | |
| 2 | 280 | 3.3% | |
| 4 | 262 | 3.1% | |
| 3 | 249 | 3.0% | |
| 7 | 249 | 3.0% | |
| 6 | 247 | 3.0% | |
| 5 | 243 | 2.9% | |
| 8 | 235 | 2.8% | |
| 1 | 233 | 2.8% | |
| 9 | 208 | 2.5% | |
| Other values (335) | 3842 | 46.0% |
| Value | Count | Frequency (%) | |
| 0 | 2311 | 27.6% | |
| 1 | 233 | 2.8% | |
| 2 | 280 | 3.3% | |
| 3 | 249 | 3.0% | |
| 4 | 262 | 3.1% |
| Value | Count | Frequency (%) | |
| 4136 | 1 | < 0.1% | |
| 2908 | 1 | < 0.1% | |
| 2693 | 1 | < 0.1% | |
| 2320 | 1 | < 0.1% | |
| 1568 | 1 | < 0.1% |
| Distinct | 254 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.06771145 |
|---|---|
| Minimum | 0 |
| Maximum | 2896 |
| Zeros | 3002 |
| Zeros (%) | 35.9% |
| Memory size | 65.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 12 |
| 95-th percentile | 67 |
| Maximum | 2896 |
| Range | 2896 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 60.93694708 |
|---|---|
| Coefficient of variation (CV) | 3.792509423 |
| Kurtosis | 689.6999538 |
| Mean | 16.06771145 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 19.50701852 |
| Sum | 134310 |
| Variance | 3713.311519 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3002 | 35.9% | |
| 1 | 697 | 8.3% | |
| 2 | 617 | 7.4% | |
| 3 | 425 | 5.1% | |
| 4 | 329 | 3.9% | |
| 5 | 275 | 3.3% | |
| 6 | 210 | 2.5% | |
| 7 | 175 | 2.1% | |
| 8 | 157 | 1.9% | |
| 9 | 130 | 1.6% | |
| Other values (244) | 2342 | 28.0% |
| Value | Count | Frequency (%) | |
| 0 | 3002 | 35.9% | |
| 1 | 697 | 8.3% | |
| 2 | 617 | 7.4% | |
| 3 | 425 | 5.1% | |
| 4 | 329 | 3.9% |
| Value | Count | Frequency (%) | |
| 2896 | 1 | < 0.1% | |
| 1276 | 1 | < 0.1% | |
| 1095 | 1 | < 0.1% | |
| 1093 | 1 | < 0.1% | |
| 919 | 1 | < 0.1% |
| Distinct | 230 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.30888862 |
|---|---|
| Minimum | 0 |
| Maximum | 1022 |
| Zeros | 4807 |
| Zeros (%) | 57.5% |
| Memory size | 65.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 6 |
| 95-th percentile | 53 |
| Maximum | 1022 |
| Range | 1022 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 41.21591524 |
|---|---|
| Coefficient of variation (CV) | 3.644559303 |
| Kurtosis | 117.4813734 |
| Mean | 11.30888862 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.949449848 |
| Sum | 94531 |
| Variance | 1698.751669 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 4807 | 57.5% | |
| 2 | 398 | 4.8% | |
| 1 | 370 | 4.4% | |
| 3 | 281 | 3.4% | |
| 4 | 216 | 2.6% | |
| 5 | 171 | 2.0% | |
| 6 | 161 | 1.9% | |
| 8 | 128 | 1.5% | |
| 7 | 125 | 1.5% | |
| 9 | 88 | 1.1% | |
| Other values (220) | 1614 | 19.3% |
| Value | Count | Frequency (%) | |
| 0 | 4807 | 57.5% | |
| 1 | 370 | 4.4% | |
| 2 | 398 | 4.8% | |
| 3 | 281 | 3.4% | |
| 4 | 216 | 2.6% |
| Value | Count | Frequency (%) | |
| 1022 | 1 | < 0.1% | |
| 720 | 1 | < 0.1% | |
| 681 | 1 | < 0.1% | |
| 650 | 1 | < 0.1% | |
| 604 | 1 | < 0.1% |
| Distinct | 122 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.241057543 |
|---|---|
| Minimum | 0 |
| Maximum | 1057 |
| Zeros | 3218 |
| Zeros (%) | 38.5% |
| Memory size | 65.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 21 |
| Maximum | 1057 |
| Range | 1057 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 22.94153141 |
|---|---|
| Coefficient of variation (CV) | 4.377271423 |
| Kurtosis | 909.5805874 |
| Mean | 5.241057543 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 24.74795541 |
| Sum | 43810 |
| Variance | 526.3138633 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 3218 | 38.5% | |
| 1 | 1691 | 20.2% | |
| 2 | 812 | 9.7% | |
| 3 | 473 | 5.7% | |
| 4 | 340 | 4.1% | |
| 5 | 251 | 3.0% | |
| 6 | 201 | 2.4% | |
| 7 | 183 | 2.2% | |
| 8 | 127 | 1.5% | |
| 9 | 90 | 1.1% | |
| Other values (112) | 973 | 11.6% |
| Value | Count | Frequency (%) | |
| 0 | 3218 | 38.5% | |
| 1 | 1691 | 20.2% | |
| 2 | 812 | 9.7% | |
| 3 | 473 | 5.7% | |
| 4 | 340 | 4.1% |
| Value | Count | Frequency (%) | |
| 1057 | 1 | < 0.1% | |
| 844 | 1 | < 0.1% | |
| 753 | 1 | < 0.1% | |
| 396 | 1 | < 0.1% | |
| 329 | 1 | < 0.1% |
| Distinct | 517 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 63.37181481 |
|---|---|
| Minimum | 1 |
| Maximum | 8253 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 6 |
| median | 18 |
| Q3 | 51 |
| 95-th percentile | 238.1 |
| Maximum | 8253 |
| Range | 8252 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 199.3948557 |
|---|---|
| Coefficient of variation (CV) | 3.146428051 |
| Kurtosis | 432.2219434 |
| Mean | 63.37181481 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 15.53653482 |
| Sum | 529725 |
| Variance | 39758.30849 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 2 | 548 | 6.6% | |
| 3 | 407 | 4.9% | |
| 1 | 316 | 3.8% | |
| 4 | 314 | 3.8% | |
| 5 | 305 | 3.6% | |
| 7 | 264 | 3.2% | |
| 6 | 262 | 3.1% | |
| 9 | 235 | 2.8% | |
| 8 | 227 | 2.7% | |
| 11 | 199 | 2.4% | |
| Other values (507) | 5282 | 63.2% |
| Value | Count | Frequency (%) | |
| 1 | 316 | 3.8% | |
| 2 | 548 | 6.6% | |
| 3 | 407 | 4.9% | |
| 4 | 314 | 3.8% | |
| 5 | 305 | 3.6% |
| Value | Count | Frequency (%) | |
| 8253 | 1 | < 0.1% | |
| 4024 | 1 | < 0.1% | |
| 3552 | 1 | < 0.1% | |
| 3277 | 1 | < 0.1% | |
| 3137 | 1 | < 0.1% |
| Distinct | 78 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 4383 |
| Missing (%) | 52.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69.18762575 |
|---|---|
| Minimum | 19 |
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.3 KiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 43.75 |
| Q1 | 61 |
| median | 71 |
| Q3 | 79 |
| 95-th percentile | 89 |
| Maximum | 98 |
| Range | 79 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 13.75648075 |
|---|---|
| Coefficient of variation (CV) | 0.1988286286 |
| Kurtosis | 0.09917993453 |
| Mean | 69.18762575 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -0.5712648591 |
| Sum | 275090 |
| Variance | 189.2407626 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 71 | 137 | 1.6% | |
| 70 | 128 | 1.5% | |
| 80 | 120 | 1.4% | |
| 69 | 120 | 1.4% | |
| 74 | 118 | 1.4% | |
| 73 | 118 | 1.4% | |
| 72 | 118 | 1.4% | |
| 75 | 116 | 1.4% | |
| 68 | 116 | 1.4% | |
| 66 | 114 | 1.4% | |
| Other values (68) | 2771 | 33.1% | |
| (Missing) | 4383 | 52.4% |
| Value | Count | Frequency (%) | |
| 19 | 1 | < 0.1% | |
| 20 | 2 | < 0.1% | |
| 23 | 1 | < 0.1% | |
| 24 | 2 | < 0.1% | |
| 25 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 98 | 2 | < 0.1% | |
| 97 | 10 | 0.1% | |
| 96 | 14 | 0.2% | |
| 95 | 11 | 0.1% | |
| 94 | 18 | 0.2% |
| Distinct | 105 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 4383 |
| Missing (%) | 52.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.53998994 |
|---|---|
| Minimum | 4 |
| Maximum | 113 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.3 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 12 |
| median | 24 |
| Q3 | 40 |
| 95-th percentile | 71 |
| Maximum | 113 |
| Range | 109 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 20.42759043 |
|---|---|
| Coefficient of variation (CV) | 0.7157532456 |
| Kurtosis | 0.7323293219 |
| Mean | 28.53998994 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 1.068743552 |
| Sum | 113475 |
| Variance | 417.2864507 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 4 | 143 | 1.7% | |
| 5 | 135 | 1.6% | |
| 12 | 113 | 1.4% | |
| 17 | 113 | 1.4% | |
| 7 | 108 | 1.3% | |
| 6 | 105 | 1.3% | |
| 9 | 104 | 1.2% | |
| 10 | 102 | 1.2% | |
| 8 | 101 | 1.2% | |
| 18 | 99 | 1.2% | |
| Other values (95) | 2853 | 34.1% | |
| (Missing) | 4383 | 52.4% |
| Value | Count | Frequency (%) | |
| 4 | 143 | 1.7% | |
| 5 | 135 | 1.6% | |
| 6 | 105 | 1.3% | |
| 7 | 108 | 1.3% | |
| 8 | 101 | 1.2% |
| Value | Count | Frequency (%) | |
| 113 | 1 | < 0.1% | |
| 107 | 1 | < 0.1% | |
| 106 | 1 | < 0.1% | |
| 105 | 1 | < 0.1% | |
| 104 | 1 | < 0.1% |
| Distinct | 88 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 3528 |
| Missing (%) | 42.2% |
| Memory size | 65.3 KiB |
| tbd | |
|---|---|
| 8 | 165 |
| 8.2 | 160 |
| 7.8 | 155 |
| 8.3 | 137 |
| Other values (83) |
| Value | Count | Frequency (%) | |
| tbd | 1132 | 13.5% | |
| 8 | 165 | 2.0% | |
| 8.2 | 160 | 1.9% | |
| 7.8 | 155 | 1.9% | |
| 8.3 | 137 | 1.6% | |
| 8.1 | 136 | 1.6% | |
| 8.5 | 130 | 1.6% | |
| 7.9 | 126 | 1.5% | |
| 7.5 | 122 | 1.5% | |
| 7.4 | 115 | 1.4% | |
| Other values (78) | 2453 | 29.3% | |
| (Missing) | 3528 | 42.2% |
Frequencies of value counts
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 0.2% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.889699725 |
| Min length | 1 |
| Distinct | 641 |
|---|---|
| Distinct (%) | 17.3% |
| Missing | 4660 |
| Missing (%) | 55.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 180.2625034 |
|---|---|
| Minimum | 4 |
| Maximum | 9851 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.3 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 11 |
| median | 28 |
| Q3 | 100 |
| 95-th percentile | 866.2 |
| Maximum | 9851 |
| Range | 9847 |
| Interquartile range (IQR) | 89 |
Descriptive statistics
| Standard deviation | 576.9884653 |
|---|---|
| Coefficient of variation (CV) | 3.200823546 |
| Kurtosis | 94.0185863 |
| Mean | 180.2625034 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 8.255115118 |
| Sum | 666791 |
| Variance | 332915.6891 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 4 | 159 | 1.9% | |
| 6 | 156 | 1.9% | |
| 5 | 156 | 1.9% | |
| 8 | 134 | 1.6% | |
| 7 | 111 | 1.3% | |
| 9 | 101 | 1.2% | |
| 10 | 93 | 1.1% | |
| 11 | 92 | 1.1% | |
| 12 | 73 | 0.9% | |
| 13 | 73 | 0.9% | |
| Other values (631) | 2551 | 30.5% | |
| (Missing) | 4660 | 55.7% |
| Value | Count | Frequency (%) | |
| 4 | 159 | 1.9% | |
| 5 | 156 | 1.9% | |
| 6 | 156 | 1.9% | |
| 7 | 111 | 1.3% | |
| 8 | 134 | 1.6% |
| Value | Count | Frequency (%) | |
| 9851 | 1 | < 0.1% | |
| 9073 | 1 | < 0.1% | |
| 8665 | 1 | < 0.1% | |
| 8003 | 1 | < 0.1% | |
| 7512 | 1 | < 0.1% |
| Distinct | 1126 |
|---|---|
| Distinct (%) | 23.1% |
| Missing | 3489 |
| Missing (%) | 41.7% |
| Memory size | 65.3 KiB |
| Capcom | 123 |
|---|---|
| Visual Concepts | 98 |
| TT Games | 72 |
| Nintendo | 72 |
| THQ | 69 |
| Other values (1121) |
| Value | Count | Frequency (%) | |
| Capcom | 123 | 1.5% | |
| Visual Concepts | 98 | 1.2% | |
| TT Games | 72 | 0.9% | |
| Nintendo | 72 | 0.9% | |
| THQ | 69 | 0.8% | |
| Omega Force | 66 | 0.8% | |
| Traveller's Tales | 60 | 0.7% | |
| Yuke's | 54 | 0.6% | |
| High Voltage Software | 46 | 0.6% | |
| Square Enix | 46 | 0.6% | |
| Other values (1116) | 4164 | 49.8% | |
| (Missing) | 3489 | 41.7% |
Frequencies of value counts
Unique
| Unique | 478 ? |
|---|---|
| Unique (%) | 9.8% |
Histogram of lengths of the category
Length
| Max length | 80 |
|---|---|
| Median length | 6 |
| Mean length | 9.198349085 |
| Min length | 2 |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3561 |
| Missing (%) | 42.6% |
| Memory size | 65.3 KiB |
| E | |
|---|---|
| T | |
| M | |
| E10+ | |
| EC | 8 |
| Other values (3) | 3 |
| Value | Count | Frequency (%) | |
| E | 1880 | 22.5% | |
| T | 1404 | 16.8% | |
| M | 772 | 9.2% | |
| E10+ | 731 | 8.7% | |
| EC | 8 | 0.1% | |
| AO | 1 | < 0.1% | |
| K-A | 1 | < 0.1% | |
| RP | 1 | < 0.1% | |
| (Missing) | 3561 | 42.6% |
Frequencies of value counts
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Histogram of lengths of the category
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.115803326 |
| Min length | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Name | Platform | Year_of_Release | Genre | Publisher | NA_Sales | EU_Sales | JP_Sales | Other_Sales | Global_Sales | Critic_Score | Critic_Count | User_Score | User_Count | Developer | Rating | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | LEGO Batman: The Videogame | Wii | NaN | Action | Warner Bros. Interactive Entertainment | 180 | 97 | 0 | 28 | 306 | 74.0 | 17.0 | 7.9 | 22.0 | Traveller's Tales | E10+ |
| 1 | LEGO Indiana Jones: The Original Adventures | Wii | NaN | Action | LucasArts | 151 | 61 | 0 | 21 | 234 | 78.0 | 22.0 | 6.6 | 28.0 | Traveller's Tales | E10+ |
| 2 | LEGO Batman: The Videogame | PSP | NaN | Action | Warner Bros. Interactive Entertainment | 56 | 44 | 0 | 27 | 128 | 73.0 | 5.0 | 7.4 | 10.0 | Traveller's Tales | E10+ |
| 3 | Combat | 2600 | NaN | Action | Atari | 117 | 7 | 0 | 1 | 125 | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | LEGO Harry Potter: Years 5-7 | Wii | NaN | Action | Warner Bros. Interactive Entertainment | 69 | 42 | 0 | 12 | 124 | 76.0 | 8.0 | 7.8 | 13.0 | Traveller's Tales | E10+ |
| 5 | LEGO Harry Potter: Years 5-7 | X360 | NaN | Action | Warner Bros. Interactive Entertainment | 51 | 37 | 0 | 9 | 97 | 77.0 | 35.0 | 7.9 | 39.0 | Traveller's Tales | E10+ |
| 6 | Yakuza 4 | PS3 | NaN | Action | Sega | 15 | 13 | 63 | 5 | 95 | 78.0 | 59.0 | 8 | 177.0 | Ryu ga Gotoku Studios | M |
| 7 | LEGO Harry Potter: Years 5-7 | PS3 | NaN | Action | Warner Bros. Interactive Entertainment | 36 | 41 | 0 | 15 | 91 | 76.0 | 27.0 | 8.3 | 48.0 | Traveller's Tales | E10+ |
| 8 | The Lord of the Rings: War in the North | X360 | NaN | Action | Warner Bros. Interactive Entertainment | 52 | 24 | 0 | 8 | 84 | 61.0 | 48.0 | 7.4 | 113.0 | Snowblind Studios | M |
| 9 | The Lord of the Rings: War in the North | PS3 | NaN | Action | Warner Bros. Interactive Entertainment | 25 | 42 | 1 | 13 | 82 | 63.0 | 33.0 | 7 | 100.0 | Snowblind Studios | M |
Last rows
| Name | Platform | Year_of_Release | Genre | Publisher | NA_Sales | EU_Sales | JP_Sales | Other_Sales | Global_Sales | Critic_Score | Critic_Count | User_Score | User_Count | Developer | Rating | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8349 | XCOM 2 | PS4 | 2016.0 | Strategy | Take-Two Interactive | 4 | 8 | 0 | 2 | 14 | 88.0 | 28.0 | 8 | 116.0 | Firaxis Games | T |
| 8350 | Total War: WARHAMMER | PC | 2016.0 | Strategy | Sega | 0 | 12 | 0 | 1 | 13 | 86.0 | 77.0 | 7.3 | 556.0 | Creative Assembly | T |
| 8351 | Culdcept Revolt | 3DS | 2016.0 | Strategy | Nintendo | 0 | 0 | 6 | 0 | 6 | NaN | NaN | NaN | NaN | NaN | NaN |
| 8352 | Hearts of Iron IV | PC | 2016.0 | Strategy | Paradox Interactive | 0 | 5 | 0 | 0 | 5 | 83.0 | 36.0 | 6.9 | 306.0 | Paradox Development Studio | NaN |
| 8353 | XCOM 2 | XOne | 2016.0 | Strategy | Take-Two Interactive | 2 | 2 | 0 | 0 | 5 | 87.0 | 17.0 | 8.1 | 40.0 | Firaxis Games | T |
| 8354 | Stellaris | PC | 2016.0 | Strategy | Paradox Interactive | 0 | 4 | 0 | 0 | 4 | 78.0 | 57.0 | 8 | 569.0 | Paradox Development Studio | NaN |
| 8355 | Total War Attila: Tyrants & Kings | PC | 2016.0 | Strategy | Koch Media | 0 | 1 | 0 | 0 | 1 | NaN | NaN | NaN | NaN | NaN | NaN |
| 8356 | Brothers Conflict: Precious Baby | PSV | 2017.0 | Action | Idea Factory | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN | NaN | NaN | NaN |
| 8357 | Phantasy Star Online 2 Episode 4: Deluxe Package | PS4 | 2017.0 | Role-Playing | Sega | 0 | 0 | 4 | 0 | 4 | NaN | NaN | NaN | NaN | NaN | NaN |
| 8358 | Phantasy Star Online 2 Episode 4: Deluxe Package | PSV | 2017.0 | Role-Playing | Sega | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN | NaN | NaN | NaN |